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Abstract. The paper deals with a mathematical model of a surveillance 
system based on a net of sensors. The signals acquired by each node of 
the net are Markovian process, have two different transition probabilities, 
which depends on the presence or absence of a intruder nearby. The 
detection of the transition probability change at one node should be 
confirmed by a detection of similar change at some other sensors. Based 
on a simple game the model of a fusion center is then constructed. The 
aggregate function defined on the net is the background of the definition 
of a non-cooperative stopping game which is a model of the multivariate 
disorder detection. 
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1 Introduction 

The aim of this consideration is to construct the mathematical model of a multi- 
variate surveillance system. It is assumed that there is net ^ oi p nodes. At each 
node the state is the signal at moment n E N which is at least one coordinate of 
the vector S E C 3?™. The distribution of the signal at each node has two 
forms and depends on a pure or a dirty environment of the node. The state of 
the system change dynamically. We consider the discrete time observed signal 
as m > p dimensional process defined on the fixed probability space {f2,J-,F). 
The observed at each node process is Markovian with two different transition 
probabilities (see [17] for details). In the signal the visual consequence of the 
transition distribution changes at moment 0^, i £ is a change of its character. 
To avoid false alarm the confirmation from other nodes is needed. The family of 
subsets (coalitions) of nodes are defined in such a way that the decision of all 
member of some coalition is equivalent with the claim of the net that the disorder 
appeared. It is not sure that the disorder has had place. The aim is to define the 
rules of nodes and a construction of the net decision based on individual nodes 
claims. Various approaches can be found in the recent research for description 
or modeling of such systems (see e.g. [53], [Mj). The problem is quite similar 
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to a pattern recognition with multiple algorithm when the fusions of individual 
algorithms results are unified to a final decision. The proposed solution will be 
based on a simple game and the stopping game defined by a simple game on 
the observed signals. It gives a centralized, Bayesian version of the multivariate 
detection with a common fusion center that it has perfect information about 
observations and a priori knowledge of the statistics about the possible distri- 
bution changes at each node. Each sensor (player) will declare to stop when it 
detects disorder at his region. Based on the simple game the sensors' decisions 
are aggregated to formulate the decision of the fusion center. The sensors' strate- 
gies are constructed as an equilibrium strategy in a non-cooperative game with 
a logical function defined by a simple game (which aggregates their decision). 

The general description of such multivariate stopping games has been for- 
mulated by Kurano, Yasuda and Nakagami in the case when the aggregation 
function is defined by the voting majority rule [9] or the monotone voting strat- 
egy [24] and the observed sequences of the random variables are independent, 
identically distributed. It was Ferguson [5] who substituted the voting aggrega- 
tion rules by a simple game. The Markov sequences have been investigated by 
the author and Yasuda pi] . 

The model of detection the disorder at each sensor are presented in the 
next section. It allows to define the individual payoffs of the players (sensors). 
Section |3] introduces the aggregation method based on a simple game of the 
sensors. Section H] contains derivation of the non-cooperative game and existence 
theorem for equilibrium strategy. The final decision based on the state of the 
sensors is given by the fusion center and it is described in Sectional The natural 
direction of further research is formulated also in the same section. A conclusion 
and resume of an algorithm for rational construction of the surveillance system 
is included in Section [71 



2 Detection of disorder at sensors 

Following the consideration of Sectionjl] let us suppose that the process n G 
N}, N = {0, 1,2,.. .}, is observed sequentially in such a way that each sensor, e.g. 
r (gets its coordinates in the vector at moment n). By assumption, it is a 
stochastic sequence that has the Markovian structure given random moment 0^, 
in such a way that the process after Or starts from state e^-i- The objective 
is to detect these moments based on the observation of ^„ . at each sensor 
separately. There are some results on the discrete time case of such disorder 
detection which generalize the basic problem stated by Shiryaev in [15] (see 
e.g. Brodsky and Darkhovsky pi, Bojdecki [1^, Yoshida [IS], Szajowski [30]) in 
various directions. 

Application of the model for the detection of traffic anomalies in networks has 
been discussed by Tartakovsky et al. [52] . The version of the problem when the 
moment of disorder is detected with given precision will be used here (see [T7] ) . 
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2.1 Formulation of the problem 

The observable random variables {^„}„gn are consistent with the filtration J-"„ 
(or Tn = o-(jto, jti, . . . , jtn))- The random vectors Jtn take values in (E, B), 
where E C 3?™. On the same probability space there are defined unobservable 
(hence not measurable with respect to random variables {^rjj^i which have 
the geometric distributions: 

Pidr = j) = Pi-'qr, qr = l~Pre (0, 1), J = 1, 2, . . .. (1) 

The sensor r follows the process which is based on switching between two, 
time homogeneous and independent, Markov processes {X*„}„gN, « = 0, 1, r G 91 
with the state space (E, B), both independent of {9r}™=i- Moreover, it is assumed 
that the processes have transition densities with respect to the cr-finite 

measure fi, i.e., for any B G B we have 

Plixi, eB) = P{XI, e B\xi„ = x) - / f^{y)^^{dy). (2) 

Jb 

The random processes {Xm}, {X^^^} and the random variables 9r are 

connected via the rule: conditionally on Or — k 

_rx°„, iffc>n, 

where {Xl^} is started from X^ (but is otherwise independent of X'^ .). 

For any fixed d £ {0, 1,2,.. .} we are looking for the stopping time t* G T 
such that 

Px{\Or - < d) = sup P,(|ft, -T\<d) (3) 

where 6"^ denotes the set of all stopping times with respect to the filtration 
{Tn}nefi- The parameter d determines the precision level of detection and it can 
be different for too early and too late detection. 



2.2 Construction of the optimal detection strategy 

In [17] the construction of r* by transformation of the problem to the optimal 
stopping problem for the Markov process ^ has been made, such that = 

drn-l-d,n^nn), whcrC n-l-d.7i (X r n-l~d, ■ ■ ■ ,^ rn) and Um is the 

posterior process: 

77,0 = 0, 

Brn^PAOr <n\Tn), n=l,2,... 

which is designed as information about the distribution of the disorder instant Or- 
In this equivalent the problem of the payoff function for sensor r is hr{~^r d+2, ot)- 
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3 The aggregated decision via the cooperative game 

There are various methods combining the decisions of several classifiers or sen- 
sors. Each ensemble member contributes to some degree to the decision at any 
point of the sequentially delivered states. The fusion algorithm takes into account 
all the decision outputs from each ensemble member and comes up with an en- 
semble decision. When classifier outputs are binary, the fusion algorithms include 
the majority voting [TU], [TT], naive Bayes combination [5], behavior knowledge 
space [7], probability approximation [5] and singular value decomposition [12j . 

The majority vote is the simplest. The extension of this method is a simple 
game. 

3.1 A simple game 

Let us assume that there are many nodes absorbing information and make deci- 
sion if the disorder has appeared or not. The final decision is made in the fusion 
center which aggregates information from all sensors. The nature of the system 
and their role is to detect intrusion in the system as soon as possible but without 
false alarm. 

The voting decision is made according to the rules of a simple game. Let us 
recall that a coalition is a subset of the players. Let C — {C : C C CTl} denote 
the class of all coalitions. 

Definition 1. (see U5f . JS^) A simple game is coalition game having the char- 
acteristic function, (/){■) : C — > {0, 1}. 

Let us denote W = {C d m : 0(C) = 1} and £ = {C C 01 : </>(C) = 0}. The 
coalitions in W are called the winning coalitions, and those from C are called 
the losing coalitions. 

Assumptions 2 By assumption the characteristic function satisfies the proper- 
ties: 

2. C; 

3. (the monotonicity): T C S £ C implies T E C. 

3.2 The aggregated decision rule 

When the simple game is defined and the players can vote presence or absence, 
Xi ^ I or Xi — 0, i £ oi the intruder then the aggregated decision is given by 
the logical function 



7r(a;i,X2, . . . ,Xp 



(4) 



For the logical function tt we have (cf |24| ) 



it{x 



■\ . . . ,a;P) = • 7r(a;\ . . . ,1, . . . ,xP) -\-x^ ■ 7r(x\ ... ,0, ... , 



X' 



)• 
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4 A non-cooperative stopping game 

Following the results of the author and Yasuda ^2V the multilateral stopping 
of a Markov chain problem can be described in the terms of the notation used 
in the non-cooperative game theory (see [2], [1], [13], [IS]). Let {Jtn,dn,^x), 
n = 0, 1, 2, . . . , TV, be a homogeneous Markov chain with state space (E, B). The 
horizon can be finite or infinite. The players are able to observe the Markov chain 
sequentially. Each player has their utility function fi : E — > 5R, i ~ 1, 2, . . . ,p, 
such that Ea;|/i(^i)| < oo. If process is not stopped at moment n, then each 
player, based on 5^„, can declare independently their willingness to stop the 
observation of the process. 

Definition 3. (see |24j ') An individual stopping strategy of the player i (ISS) is 
the sequence of random variables {<^n}n=i' where al^ : Q ^ {0, 1}, such that a^^ 
is ^n-Tneasurable. 

The interpretation of the strategy is following. If = 1 then player i declares 
that they would like to stop the process and accept the realization of Xn. Denote 
cr' — (ct^, cr|, . . . , ctJy) and let 6* be the set of ISSs of player i, i = 1,2,... ,p. 
Define 

6 = 6^ X ©2 X ... X e^. 

The element a — {<7^,a^, . . . , ct^)^ G & will be called the stopping strategy (SS). 
The stopping strategy tr 6 © is a random matrix. The rows of the matrix are the 
ISSs. The columns are the decisions of the players at successive moments. The 
factual stopping of the observation process, and the players realization of the 
payoffs is defined by the stopping strategy exploiting p-variate logical function. 
Let TT : {0, 1}^ — >■ {0, 1}. In this stopping game model the stopping strategy is the 
list of declarations of the individual players. The aggregate function tt converts 
the declarations to an effective stopping time. 

Definition 4. A stopping time t7r(o') generated by the SS a € & and the aggre- 
gate function tt is defined by 

t^{a) = inf {1 <n<N: 7T{alal ...,<) = 1} 

(inf(0) = oo). Since tt is fixed during the analysis we skip index n and write 
t((T) = W(a). 

We have {uj e H : W{a) = n} = (XZH^ £ ^ : '^(o'L CTfe. ■.-,<)= 0} n {w G 
f2 : 7r((7^, cr^, . . . , a^) = 1} G ^n, then the random variable t,r{o') is stopping 
time with respect to {dn}n=i- For any stopping time tTr(cr) and i G {1, 2, . . . ,p}, 
let 

f (Y ) - f h{Xn) ff t^(<T) = n, 

M^iA'T)) j limsup„^^ fdXn) if = oo 

(cf [in], [U). If players use SS ct G © and the individual preferences are con- 
verted to the effective stopping time by the aggregate rule tt, then player i gets 
/i(^t„(o-))- 
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Let V = {*a\ V2, . . . , V^)^ be fixed SS. Denote 

*a{i) = (*a\..., cr', V+\ . . . , *aPf. 

Definition 5. (of. |21| ) Let the aggregate rule n be fixed. The strategy *a = 
(*cr^, V^, . . . , V^)^ G © is an equilibrium strategy with respect to tt if for each 
i G {f , 2, . . . ,p} and any G 6' we /lawe 

E2;/j(^t„Ccr)) > ^xfl(^i^{'a{l)))- (5) 

The set of SS ©, the vector of tlie utility functions / = (/i, /2, . . . , fp) and the 
monotone rule tt define the non-cooperative game G = ((3,/,7r). The construction 
of the equilibrium strategy V G 6 in is provided in [3T] . For completeness this 
construction will be recalled here. Let us define an individual stopping set on the 
state space. This set describes the ISS of the player. With each ISS of player i 
the sequence of stopping events 13^ — {uj : al^ — 1} combines. For each aggregate 
rule TT there exists the corresponding set value function 11 : ^ ^ ^ such that 
7^(0-^,0-^, = 7r{Ici , , • • • , Ids } = ^n(Dl^,Dl.....,DZ)- For solution of the 

considered game the important class of ISS and the stopping events can be 
defined by subsets G B oi the state space E. A given set G B will be called 
the stopping set for player i at moment n if = {ui : X„ G C*} is the stopping 
event. 

For the logical function tt we have 

i i 

7r(x\ . . . = a;* • 7r(a;\ . . . ,1, . . . ,xP) + x' ■ 7r(x\ . . . ,6, . . .,xP). 
It implies that for G 5^ 

i 

[JiD" r\n{D\..., DP)}. 

Let /i, gi be the real valued, integrable (i.e. Ea;|/i(Xi)| < oo) function defined 
on E. For fixed D^j, j = 1, 2, . . . ,p, j 7^ i, and E B define 

where 'DiiA) = n{Dl, . . . , D'f^ A, Dl+\ . . . , D^) and D{ = {w : X„ G C'}. 
Let a~^ = max{0, a} and a~ — min{0, —a}. 

Lemma 1. Let fi, gi, be integrable and let £ B, j ^ 1,2, . . . ,p, j ^ i, be 
fixed. Then the set *C" = {x G E : fi{x) — gi{x) >0} £ B is such that 

i}{*C') = sup tPiC) 
c^eB 

and 

= E,(/,(Xi) - .g.(Xi))+I,^^(^) (7) 

-E,(/,(Xi) - .9,(Xi))-I,c^(,,) +E,g,;(Xi). 

Based on Lemma [T] we derive the recursive formulae defining the equilibrium 
point and the equilibrium payoff for the finite horizon game. 
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4.1 The finite horizon game 

Let horizon N be finite. If the equilibrium strategy *a exists, then we denote 
Vi,N{x) = Ea;/i(Xt(.cr)) the equilibrium payoff of i-ih player when Xq = x. For 
the backward induction we introduce a useful notation. Let <S\ = {{cr^},fc = 
n, . . ., A^} be the set ofISS for moments n < fc < iV and ©„ = ©^xS^ x. . .x©?;. 
The SS for moments not earlier than n is "cr = ("ct"'^, "cr^, . . . , 'V^) G ©„, where 
"c^' = «> • • ■ , cr^v)- Denote 

in = in(o-) = t("cr) = inf{n < fc < : 7r(CT^, ct^, . . . , cr^) = 1} 

to be the stopping time not earlier than n. 

Definition 6. The stopping strategy "*cr = ("V^, "*cr^, . . . , "V^) is an equilib- 
rium in &n if 

for every i £ {1, 2, . . . ,p}, where 



Denote 



Vi,N-ri+\{^n-\) — E:r[/i(Xf^(.^))|3'n-l] — Ex„_i /t (-'i't„ (-cr) ) ■ 

At moment n = N the players have to declare to stop and Vifi{x) = fi{x). Let us 
assume that the process is not stopped up to moment n, the players are using the 
equilibrium strategies V^, i = 1, 2, . . . at moments fc = n + 1, . . . , TV. Choose 
player i and assume that other players are using the equilibrium strategies *cr^, 
j ^ i., and player i is using strategy ct^ defined by stopping set C*. Then the 
expected payoff (pjv_„(X„_i, C*) of player j in the game starting at moment n, 
when the state of the Markov chain at moment n — 1 is Xn-i^ is equal to 

where = n{*Di, . . . , ^, ■ • ■ , *Dl). 

By Lemma [T] the conditional expected gain ifN-n{XM~m C**) attains the 
maximum on the stopping set = {x S E : — i'i.Ar-n(a;) > 0} and 

Vi^N -n+l{Xn~l) = ^x[{fi{Xn) — Vi^N ~n{Xn))'^l^*D^(n)\'Sn-l] 

-Ea;[(/i(^„) - fi,Af-n(-'^n))"I"Ll„(0)|S'n-l] (1) 
+E {Xn)\^n-l\ 

Pa;— a.e.. It allows to formulate the following construction of the equilibrium 
strategy and the equilibrium value for the game Q. 

Theorem 1. In the game Q with finite horizon N we have the following solution. 
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(i) The equilibrium value Vi{x), i = 1,2, . . . ,p, of the game Q can be calculated 
recursively as follows: 

1. Vis){x) = f^{x); 

2. For n = 1,2, N we have Px^a.e. 

Vi,n{x) = Ex[{fi{XN-n + l) — ^'i ,n- 1 + 1 ) ) li'IJ^ _ ^ (fi) ISjV- n] 

— n 1 

+ Ex[Vi,n-l{XN~n + l)\dN-n], 

for i= 1,2, p. 

(ii) The equilibrium strategy *a E & is defined by the SS of the players *cr^, 
where *a\ = 1 i/ X„ e *Q, and *Q = {x e E : fi{x) - Vi^N-n{x) > 0}, 
71 = 0, 1,...,A^. 

We have Vi{x) = v^.^ix), and Ea;/j(Xt(.^)) = Vi^N{x), i = l,2,...,p. 



5 Infinite iiorizon game 

In this class of games the equihbrium strategy is presented in Definition [5] but 
in class of SS 

e} = {a e6* : E^fr{Xt(^)) < oo for every x e E, i ^ 1,2, . . . ,p}. 
Let *cr £ 6*j: be an equilibrium strategy. Denote 

Let us assume that ("+i)*o- e is constructed and it is an equilibrium 

strategy. If players j — 1,2, ...,p, j ^ i, apply at moment n the equilibrium 
strategies , player i the strategy cr^ defined by stopping set and ("+i)*cr at 
moments n + l,n + 2, . . ., then the expected payoff of the player i, when history 
of the process up to moment n — 1 is known, is given by 

where ^*D^iA) = n{*Dl *D\~\A, *D\+\ *DP), *Di = {u e f2 : *ai = 
1}, J = l,2,...,p, J ^ z,andi^; = {w e : < = 1} = 1} = {w G 12 : X„ e C)}. 
By Lemma [T] the conditional expected gain Lpn{Xn-i, C*) attains the maximum 
on the stopping set *C,j = {x G IK : fi{x) > Vi{x)} and 

ipn{X„-l,*C') = E,[(/,(X„) - V,{Xr,)) + h,D„in)\^n-l] 

Let us assume that there exists solution {wi{x),W2{x), . . . ,Wp{x)) of the 
equations 

w,ix) = E,(/,(Xi) - w,iXi))+h.D,^a) (1) 
-E^(/,(Xi) - w;,(Xi))-L.^^(0) + E,w;,(Xi), 
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i 1,2, ... ,p. Consider the stopping game with the foUowing payoff function 
for i = 1,2, . . . ,p. 



Lemma 2. Let *a G 6^ be an equilibrium strategy in the infinite horizon game 
Q . For every N we have 



Theorem 2. Let (X„,5n, Pa;)J^Lo homogeneous Markov chain and the pay- 
off functions of the players fulfill (dP. If t* = t{*a), *a G 6^ then ^xfi{Xf ) = 

Vi{x). 

Theorem 3. Let the stopping strategy *a G &*j be defined by the stopping sets 
= {a; G E : fi{x) > Vi{x)}, i — 1,2, . . . ,p, then *a is the equilibrium strategy 
in the infinite stopping game Q . 

6 Determining the strategies of sensors 

Based on the model constructed in Sections for the net of sensors with 
the fusion center determined by a simple game, one can determine the rational 
decisions of each nodes. The rationality of such a construction refers to the 
individual aspiration for the highest sensitivity to detect the disorder without 
false alarm. The Nash equilibrium fulfills requirement that nobody deviates from 
the equilibrium strategy because its probability of detection will be smaller. The 
role of the simple game is to define wining coalitions in such a way that the 
detection of intrusion to the guarded area is maximal and the probability of false 
alarm is minimal. The method of constructing the optimum winning coalitions 
family is not the subject of the research in this article. However, there are some 
natural methods of solving this problem. 

The research here is focused on constructing the solution of the non-cooperative 
stopping game as to determine the detection strategy of the sensors. To this end, 
the game analyzed in Section 2] with the payoff function of the players defined 
by the individual disorder problem formulated in Section [5] should be derived. 

The proposed model disregards correlation of the signals. It is also assumed 
that the fusion center has perfect information about signals and the information 
is available at each node. The further research should help to qualify these real 
needs of such models and to extend the model to more general cases. In some 
type of distribution of sensors, e.g. when the distribution of the pollution in the 
given direction is observed, the multiple disorder model should work better than 
the game approach. In this case the a priori distribution of disorder moment 
has the form of sequentially dependent random moments and the fusion decision 




Vi{x). 



Let us assume that for i = 1,2, . . . ,p and every x G E we have 



Ex[sup„gN/+(X„)] < oo. 



(2) 
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can be formulated as the threshold one: stop when k* disorder is detected. The 
method of a cooperative game was used in [B] to find the best coalition of sensors 
in the problem of the target localization. The approach which is proposed here 
shows possibility of modelling the detection problem by multiple agents at a 
general level. 

7 Final remarks 

In a general case the consideration of this paper leads to the algorithm of con- 
structing the disorder detection system. 

7.1 Algorithm 

1. Define a simple game on the sensors. 

2. Describe signal processes and a priori distribution of the disorder moments 
at all sensors. Establish the a posteriori processes: 7?„ — (i7i„, . . . ,IImn), 
where 77fe„ = P{9 < n| J"„). 

3. Solve the multivariate stopping game on the simple game to get the individ- 
ual strategies of the sensors. 
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